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1 

Method and Apparatus for transmitting watermark data bits 
using a spread spectrum, and for regaining watermark data 
bits embedded in a spread spectrum 

The invention relates to a method and an apparatus for 
transmitting watermark data bits using a spread spectrum, 
and to a method and an apparatus for regaining watermark 
data bits embedded in a spread spectrum. 



Background 

•Watermarking- means imperceptible insertion of information 
into multimedia data, e.g. audio data and/or video data. The 
insertion of additional information data, such as a number 
or a text, into multimedia data is performed through slight 
modification of the original multimedia data. Watermarking 
can be used for e.g. copyright protection, labelling (e.g. 
URL of a site or a site's logo), monitoring, tamper proof- 
ing, or conditional access. 

Applying 'spread spectrum' in a (RF) communications system, 
means that a small baseband signal bandwidth is intention-' 
ally spread over a larger bandwidth by injecting or adding a 
higher- frequency signal, or spreading function. As a direct 
consequence, the energy used for transmitting the signal is 
spread over a wider bandwidth, and appears as noise. 
Spread spectrum technology and the related inserted or added 
information signal can be used for implementing watermarking 
of e.g. digital audio signals, whereby the spread spectrum 
can use the complete audio spectrum from OHz to one half of 
the sampling frequency. This spectrum carries the informa- 
tion of one bit. In a modification of such systems shorter 
spread spectrum sequences are used leading to band limited 
spread spectrum signals, so that several ones of the band 
limited spread spectrum signals can be added at different 
centre frequencies to the audio spectrum, at which centre 
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frequencies the original audio signal has been notch fil- 
tered, in order to increase the bitrate of the watermark 
signals and/or to prevent attacks on the watermarked sig- 
nals. In this watermark system the spread spectrum signals 
are modulated on a carrier. 

A kj.own processing for retrieving at receiver or decoder 
side the watermark signal information bit from the spread 
spectrum is convolving the received or replayed spectrum 
with a spreading function that is time- inverse with respect 
to the original spreading function, which kind of processing 
is also called 'applying a matched filter'. If BPSK modula- 
tion was used for applying the spread spectrum fianction, the 
output of this process is a peak at the middle of the se- 
quence of correlation values, whereby the sign of such peak 
represents the value of the desired watermark signal infor- 
mation bit, c.f. Fig. 5 which shows a negative peak in the 
convolution result. If QPSK was used two peaks will be pre- 
sent in the sequence of correlation values, whereby each 
peak represents one bit value. 



Invention 

This decoding processing works fine in case undisturbed sig- 
nal are received by the decoder. However, if the received 
input signals contain multipath or echo or reverberation 
distortions, the convolved output signals will contain more 
than one peak per watermark signal information bit (i.e. per 
convolution result) to be decoded so that, e.g. depending on 
the amplitude or power of the distortion peaks, it is diffi- 
cult or in many cases even impossible to retrieve the cor- 
rect watermark information bits. 

A problem to be solved by the invention is to increase the 
robustness of spread spectrum systems against echo and re- 
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verberation distortions, and to reduce the number of errone 
ously demodulated watermark signal information bits. This 
problem is solved by the methods disclosed in claims 1 to 3 
Corresponding apparatuses which utilise these methods are 
disclosed in claims 6 to 8, respectively. 

In a first embodiment of the invention, two or more orthogo- 
nal spreading sequences or functions are combined at trans- 
mitter or encoder or source side with the original or en- 
coded audio signal in baseband, i.e. without modulating the 
spreading sequences or functions on a carrier before combin- 
ing them with the original or encoded audio signal. 'Or- 
thogonal' spreading sequences or functions means that the 
cross-correlation of such sequences yields a zero-value re- 
sult, or a very small-value result. When applying the corre- 
sponding time- inverse orthogonal spreading sequences or 
functions at receiver or decoder side, echoes that are 
longer than each one of spreading sequence's or function's 
lengths can be fully removed. 

In a second embodiment of the invention the time-inverse 
versions of not necessarily orthogonal spreading sequences 
or functions are modified at receiver or decoder side ac- 
cording to pre-known or estimated echo delay values and fad- 
ing parameters. In case of estimated echo delay values the 
delay time period measurements/calculations can be repeated 
for several succeeding audio signal frames before a valid 
delay time period value is formed. 

Advantageously the number of watermark signal bit errors due 
to echoes caused by multipath or reverberated reception con- 
ditions is substantially decreased. 

The features of the first and second embodiment can also be 
combined in that two or. more orthogonal baseband spreading 
sequences or functions are used which are being modified at 
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decoder side according to echo delay values and fading pa- 
rameters . 

in principle, the inventive method is suited for transmit- 
ting watermark data bits using a spread spectrum, said 
method including the steps: 

- Modulating said watermark data bits on an encoder pseudo- 
noise sequence; 

- Transfoarming said modulated encoder pseudo-noise sequence 
into the frequency domain and shaping it in amplitude ac- 
cording to the masking level curve of an audio signal to- 
gether with which the watermark data bit information is to 
be transmitted or transferred, and transforming said shaped 
encoder pseudo-noise frequency domain sequence back into the 
time domain; 

- Combining said inverse transformed encoder pseudo-noise 
frequency domain sequence with a current frame of data of 
said audio signal; 

- Transmitting or transferring said combined audio signal 
frame or frames carrying said watermark data bits, 
wherein the length of said encoder pseudo-noise sequence is 
one Nth of the length of a frame of said audio signal, N be- 
ing an integer number greater one, and wherein N orthogonal 
encoder pseudo-noise sequences are used per frame of said 
audio signal for carrying out said combining for correspond- 
ing sections of a current frame. 

In principle, the inventive method is suited for regaining 
watermark data bits embedded in a spread spectrum, whereby 
the corresponding original watermark data bits were modu- 
lated at encoder side on an encoder pseudo-noise sequence 
and said modulated encoder pseudo-noise sequence was trans- 
formed into the frequency domain and shaped in amplitude ac- 
cording to the masking level curve of an audio signal to- 
gether with which the watermark data bit information was 
transmitted or transferred, and said shaped encoder pseudo- 
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noise frequency domain sequence was transformed back into 
the time domain and was combined with a current frame of 
data of said audio signal, wherein the length of said en- 
coder pseudo -noise sequence was one Nth of the length of a 
frame of said audio signal, N being an integer number 
greater one, wherein N orthogonal encoder pseudo-noise se- 
quences were used per frame of said audio signal for carry- 
ing out said combining for corresponding sections of a cur- 
rent frame, said method including the steps: 

Receiving and synchronising said transmitted or trans- 
ferred audio signal; 

Convolving each one of a corresponding section of said 
current frame of data of said audio signal with the corre- 
sponding one of time-inversed versions of the N orthogonal 
encoder pseudo-noise sequences; 

- Deteirmining, for each one of said sections, from the sign 
of the peak or peaks of the corresponding convolution result 
the value of a bit of said watermark data. 

In principle, the inventive method is also suited for re- 
gaining watermark data bits embedded in a spread spectrum, 
whereby the corresponding original watermark data bits were 
modulated at encoder side on an encoder pseudo-noise se- 
quence and said modulated encoder pseudo-noise sequence was 
transformed into the frequency domain and shaped in ampli- 
tude according to the masking level curve of an audio signal 
together with which the watermark data bit information was 
transmitted or transferred, and said shaped encoder pseudo- 
noise frequency domain sequence was transformed back into 
the time domain and was combined with a current frame of 
data of said audio signal, wherein the length of said en- 
coder pseudo-noise sequence corresponded to the length of a 
frame of said audio signal and said encoder pseudo-noise se- 
quence was used for carrying out said combining for a cur- 
rent frame, said method including the steps: 
- Receiving and synchronising said transmitted or trans- 
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f erred audio signal; 

Determining in the received audio signal one or more ech 
oes and the related echo delays; 

Constructing a modified decoder pseudo-noise sequence 
5 based on the time-inversed version of said encoder pseudo- 
noise sequence whereby, according to the echo delay or de- 
lays determined, correspondingly time-shifted versions of 
said time-inversed encoder pseudo-noise sequence are com- 
bined in order to construct said modified decoder pseudo- 
10 noise sequence; 

Convolving said current frame of data of said audio sig- 
nal with said modified decoder pseudo-noise sequence; 

Determining from the sign of the peak or peaks of the 
convolution result the value of a bit of said watermark 
15 data. 

In principle the inventive apparatus is suited for transmit 
ting watermark data bits using a spread spectrum, said appa 
ratus including: 

20 - Means for modulating said watermark data bits on an en- 
coder pseudo- noise sec[uence; 

Means for transforming said modulated encoder pseudo- 
noise sequence into the frequency domain and for shaping it 
in amplitude according to the masking level curve of an au- 

25 dio signal together with which the watermark data bit infor 
mat ion is to be transmitted or transferred, and for trans- 
forming said shaped encoder pseudo-noise frequency domain 
sequence back into the time domain; 

Means for combining said inverse transformed encoder 

3 0 pseudo-noise frequency domain sequence with a current frame 
of data of said audio signal; 

Means for transmitting or transferring said combined au- 
dio signal frame or frames carrying said watermark data 
bits, 

35 wherein the length of said encoder pseudo-noise sequence 

is one Nth of the length of a frame of said audio signal, N 
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being an integer number greater one, wherein N orthogonal 
encoder pseudo-noise sequences are used per frame of said 
audio signal for carrying out said combining for correspond- 
ing sections of a current frame. 



In principle the inventive apparatus is suited for regaining 
watermark data bits embedded in a spread spectrum, whereby 
the corresponding original watermark data bits were modu- 
lated at encoder side on an encoder pseudo-noise sequence 
and said modulated encoder pseudo-noise sequence was trans- 
formed into the frequency domain and shaped in amplitude ac- 
cording to the masking level curve of an audio signal to- 
gether with which the watermark data bit information was 
transmitted or transferred, and said shaped encoder pseudo- 
noise frequency domain sequence was transformed back into 
the time domain and was combined with a current frame of 
data of said audio signal, wherein the length of said en- 
coder pseudo- noise sequence was one Nth of the length of a 
frame of said audio signal, N being an integer number 
greater one, wherein N orthogonal encoder pseudo-noise se- 
quences were used per frame of said audio signal for carry- 
ing out said combining for corresponding sections of a cur- 
rent frame, said apparatus including: 

- Means for receiving and synchronising said transmitted or 
transferred audio signal; 

- Means for convolving each one of a corresponding section 
of said current frame of data of said audio signal with the 
corresponding one of time-inversed versions of the N or- 
thogonal encoder pseudo-noise sequences, and for determin- 
ing, for each one of said sections, from the sign of the 
peak or peaks of the corresponding convolution result the 
value of a bit of said watermark data. 

In principle the inventive apparatus is suited for regaining 
watermark data bits embedded in a spread spectrum, whereby 
the corresponding original watermark data bits were modu- 
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lated at encoder side on an encoder pseudo-noise sequence 
and said modulated encoder pseudo-noise sequence was trans- 
formed into the frequency domain and shaped in amplitude ac- 
cording to the masking level curve of an audio signal to- 
5 gether with which the watermark data bit information was 

transmitted or transferred, and said shaped encoder pseudo- 
noise frequency domain sequence was transformed back into 
the time domain and was combined with a current frame of 
data of said audio signal, wherein the length of said en- 
10 coder pseudo-noise sequence corresponded to the length of a 
frame of said audio signal and said encoder pseudo-noise se- 
quence was used for carrying out said combining for a cur- 
rent frame, said apparatus including: 

- Means for receiving and synchronising said transmitted or 
15 transferred audio signal; 

- Means for determining in the received audio signal one or 
more echoes and the related echo delays, and for construct- 
ing a modified decoder pseudo -noise sequence based on the 
time-inversed version of said encoder pseudo-noise sequence 

20 whereby, according to the echo delay or delays determined, 
correspondingly time -shifted versions of said time-inversed 
encoder pseudo-noise sequence are combined in order to con- 
struct said modified decoder pseudo-noise sequence ; 

- Means for convolving said current frame of data of said 
25 audio signal with said modified decoder pseudo-noise se- 
quence, and for determining from the sign of the peak or 
peaks of the convolution result the value of a bit of said 
watermark data. 

30 Advantageous additional embodiments of the invention are 
disclosed in the respective dependent claims. 



Drawings 

35 

Exemplary embodiments of the invention are described with 
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reference to the accompanying drawings, which show in: 

Fig. 1 Watermark signal encoder and watermark signal decoder 

using spread spectrum technique; 
Fig. 2 Watermark signal decoder according to the second 

embodiment ; 

Fig. 3 spread spectrum signal in the decoder following time- 
inverse convolution, showing two successive spreading 
length portions each containing a (positive) bit sig- 
nal, the second portion containing also a (negative) 
echo of the first portion bit signal; 
Fig. 4 The bit signals of Fig. 3 wherein the echo bit signal 
is removed by the inventive features according to the 
first embodiment; 
Fig. 5 spread spectrum signal in the decoder following time- 
xnverse convolution, showing one spreading length 
portion containing a (negative) bit signal without 
echo signal; 

Fig. S The bit signal of Fig. 5 including an echo bit signal- 
Fxg. 7 The bit Signal of Fig. s wherein the echo bit signal' 

is removed by the inventive features according to the 

second embodiment. 



Exemplary embodiment s 



in the watermark signal encoder section in Fig. 1 an origi- 
nal audio input signal AUS is encoded, or processed such 
that the masking level threshold information for an encoding 
xs retrieved, using a psycho -acoustic model calculator 
PSYMC. The resulting masking level threshold information 
MLAUD for the audio data frequency spectrum coefficients 
(resulting e.g. from an FFT or MDCT) of a current audio sig- 
nal input frame are fed together with related control data 
or coding parameters CTRLD to a watermark shaping and embed- 
ding stage WATSE. input watermark data IWATD enter a bit 
value modulation stage BVMOD in which a current bit value of 
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the IWATD data is used to correspondingly modulate a current 
encoder pseudo-noise sequence section ENCPNSEQ_i . For exam- 
ple, if the current bit value is the encoder pseudo- 
noise sequence section ENCPNSEQ_i is left unchanged whereas, 
if the current bit value is '0- or the encoder pseudo- 
noise sequence section ENCPNSEQ_i is inverted. Sequence 
ENCPNSEQ_i consists of e.g. a -random" distribution of ' C 
or and ' +1 ' . If two different sequences ENCPNSEQ_1 and 
ENCPNSEQ_2 are used each of which has a length that is one 
half of the audio data frame length (of e.g. 4096 samples), 
two watermark data bits per audio frame can be transmitted. 
If N different sequences ENCPNSEQ_1 to ENCPNSEQ_N are used, 
each one of them has a length of l/N of the audio data frame 
length, and N watermark data bits per audio frame can be 
transmitted . 

According to the first embodiment of the invention these 
different sequences ENCPNSEQ_1 to ENCPNSEQ_N are orthogonal. 
•Orthogonal' means that any pair of sequences out of the N 
sequences has a cross-correlation that has an output value 
of zero, or a very small output value near zero. According 
to the second embodiment of the invention a single encoder 
pseudo-noise sequence ENCPNSEQ is used. 

The pieces of watermark signals WATS resulting from stage 
BVMOD are combined with, or added to, corresponding frame 
sections of spectral audio data in baseband fashion in the 
watermark shaping and embedding stage WATSE. This is per- 
formed in stage WATSE as follows. A current encoder pseudo- 
noise sequence section ENCPNSEQ_i is transformed into the 
frequency domain. In the frequency domain, this sequence is 
•shaped- according to, i.e. its amplitudes envelope is made 
conforming to, the corresponding frame section masking level 
shape or curve in masking level threshold information MLAUD. 
in case there are two encoder pseudo-noise sequence sections 
per audio frame, sequence ENCPNSEQ_1 is shaped according to 
the masking level shape or curve in the first half of the 
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audio frame and sequence ENCPNSEQ_2 is shaped according to 
the masking level shape or curve in the second half of the 
audio frame. 

Following such shaping the encoder pseudo-noise sequence 
5 section ENCPNSEQ_i is inversely transformed back into the 
time domain. The inversely transformed sequence sections 
ENCPNSEQ_1 to ENCPNSEQ_N are added or combined with the time 
domain sample values of the current audio frame . 
As an alternative, the encoder pseudo-- noise sequence section 
10 ENCPNSEQ_i as shaped in the frequency domain can be combined 
with the frequency domain coefficient values of the current 
audio frame, whereby an encoded audio signal is transmitted 
via channel WATAUTRMCH. 

The output signal of stage WATSE passes through transmitter 
15 stage TRM (which includes e.g. a D/A converter and/or an am- 
plifier) and channel WATAUTRMCH to a watermark signal de- 
coder or receiver. 

Unintended, in the wateirmarked audio transmission channel 
20 WATAUTRMCH a noise or reverberation or echo signal NRE is 
added. This channel can be represented by an acoustic con- 
nection between a loudspeaker and a microphone. 

In the first -embodiment watermark signal decoder section in 
25 Fig. 1 the distorted transmitted signal enters a receiver 
stage REC, wherein e.g. a coarse synchronisation and/or an 
A/D conversion is performed. Its output signal passes 
through a bit or fine synchronisation stage SYNC to a data 
recovery matched filter stage DRECMF, or time-inverse convo- 
30 lution stage DRECMF. This stage convolves, or filters, a 
current incoming audio frame, or a respective section of 
this frame, with a decoder pseudo-noise sequence ENCPNSBQ_i 
that is pre -known by, or stored in, the decoder and is time- 
inverse to the related encoder pseudo-noise sequence section 
35 ENCPNSEQ_i. In case two orthogonal encoder pseudo-noise se- 
quence sections per audio frame were used in the watermark 
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signal encoder, sequence DECPNSEQ__1 is convolved with the 
first half of the currently received audio frame and se- 
quence DECPNSEQ_2 is convolved with the second half of the 
currently received audio frame. 

5 . 

In Fig. 3 corresponding two successive spreading length por- 
tions resulting from this time-inverse convolution are de- 
picted, each containing a (positive 1.0 valued peak) bit 
signal of the transmitted watermark data, the second portion 

10 containing also a (negative peak) echo of the first portion 
bit signal. Basically, a correctly transmitted watermark bit 
'appears' as a peak in the middle of the 2*N-1 intermediate 
correlation results. However, due to echo signals a peak 
could occur at the same or a different position. The related 

15 audio data frame had a length of 4096 samples, therefore the 
correlation with DECPNSEQ__1 and with DECPNSEQ_2 each pro- 
vides the results for 4095 correlation steps. 

According to the invention, after the watermark signal de- 
20 coder receiver part is synchronised, in order to remove a 
negative echo peak signal in the DECPNSEQ_2 correlation re- 
sult, either the 'wrongs position or the smaller amplitude > 
'-1' (or < "+!', respectively) or both facts are used in 
stage DRECMF to not considering such echo peak signals as 
25 valid watermark data bits, or to remove such echo peaks 

leading to a correlation output signal according to Fig. 4. 

Stage DRECMF provides the watermark signal decoder output 
watermark data OWATD which, despite the NRE added on the 
30 transmission channel, correspond 100% or nearly 100% to the 
input watermark data IWATD. 

In the second embodiment of the invention a single encoder 
pseudo-noise sequence ENCPNSEQ is used in the watermark sig- 
35 nal encoder and a single correspondingly time-inverse de- 
coder pseudo-noise sequence DECPNSEQ is used in the water- 
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mark signal decoder. Apart from that, the watermark signal 
encoder operates like in the first embodiment. 

The watermark signal decoder stages REC, SYNC, and DRECMF 
operate like in the first embodiment. However, the output 
signal of receiver stage REC is also fed to an echo detec- 
tion stage EDET in which echo, multipath or reverberation 
distortions are detectable and the related delays are calcu- 
lated. The delay can be estimated using different known 
methods, e.g. by correlation of the received signal with the 
unmodified decoder pseudo-noise sequence for one or more 
frames . 

In stage EDET a modified decoder spread spectrum or pseudo- 
noise sequence MDECPNSBQ is formed by shifting or multiple 
shifting the position of the original decoder pseudo-noise 
sequence DECPNSEQ according to the calculated delay or de- 
lays, respectively. The output modified decoder spread spec- 
trum sequence MDECPNSEQ is the sum of the original sequence 
DECPNSEQ and correspondingly delayed (and possibly amplified 
due to fading) versions of the original sequence, whereby 
the corresponding cut-off tails of the delayed versions are 
not considered. 

Fig. 5 shows a corresponding convolution processing output 
for a signal received without echo. The negative data bit- 
related peak at position 4096 can clearly be seen. 

Fig. 6 shows a corresponding convolution processing output 
of the same audio frame but including an echo. The main 
peak, which has an amplitude that is even greater than that' 
of the correct peak in the middle, is located at the wrong 
position and has the wrong sign or direction. 

Pig. 7 shows a corresponding convolution processing output 
of the same audio frame, which upon receipt included an echo 
but which was convolved or filtered with a correspondingly 
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modified sequence MDECPNSEQ. In this case the wrong peak has 
a significantly reduced amplitude whereas the amplitude of 
the correct data bit peak has the correct amplitude '-1' and 
is therefore correctly identified. 

In stage DRECMF the delay measurements, or the correla- 
tion/convolution results, for several (succeeding) audio 
frames are evaluated before a final result on the echo delay 
is formed. 

AS an alternative, the encoder pseudo-noise sequence section 
ENCPNSEQ_i or ENCPNSEQ, respectively, as shaped in the fre- 
quency domain can be combined with the frequency domain co- 
efficient values of the current audio frame, whereby an en- 
coded audio signal is transmitted via channel WATAUTRMCH and 
is correspondingly decoded in a watermark signal decoder. 

The pseudo-noise sequences used are calculated by a given 
algorithm based on a start value. In order to transmit se- 
cret watermark data, the start value or even that algorithm 
can be encrypted and transmitted to the watermark signal de- 
coder wherein it is used to calculate the decoder pseudo- 
noise sequences DECPNSEQ_i and the modified decoder pseudo- 
noise sequence MDECPNSEQ. 

instead of audio signals, video signals can be used corre- 
spondingly for transmitting watermark data according to the 
invention. 
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Claims 

1. Method for transmitting watermark data bits (IWATD) using 
a spread spectrum, said method including the steps: 

- modulating (BVMOD) said watermark data bits on an encoder 
pseudo-noise sequence (ENCPNSEQ) ,- 

- Transforming (WATSE) said modulated encoder pseudo-noise 
sequence (WATS) into the frequency domain and shaping it 
in amplitude according to the masking level curve of an 
audio signal together with which the watermark data bit 
information is to be transmitted or transferred, and 
transforming (WATSE) said shaped encoder pseudo-noise 
frequency domain sequence back into the time domain 

- Combining (WATSE) said inverse transformed encoder 
pseudo-noise frequency domain sequence with a current 
frame of data of said audio signal; 

- Transmitting or transferring (TRM) said combined audio 
signal frame or frames carrying said watermark data bits, 

wherein the length of said encoder pseudo-noise sequence 
(ENCPNSEQ) is one Nth of the length of a frame of said 

audio signal, N being an integer number greater one, 

wherein N orthogonal encoder pseudo -noise sequences 
(ENCPNSEQ) are used per frame of said audio signal for 

carrying out said combining for corresponding sections of 

a current frame. 



2. Method for regaining watermark data bits (IWATD) embedded 
in a spread spectrum, whereby the corresponding original 
watermark data bits were modulated (BVMOD) at encoder 
side on an encoder pseudo-noise sequence (ENCPNSEQ) and 
said modulated encoder pseudo-noise sequence (WATS) was 
transformed (WATSE) into the frequency domain and shaped 
in amplitude according to the masking level curve (PSYMC) 
of an audio signal together with which the watermark data 
bit information was transmitted or transferred (TRM) , and 
said shaped encoder pseudo -noise frequency domain se- 



PD03 0120 -Ha- 0512 03 



16 

quence was transformed (WATSE) back into the time domain 
and was combined with a current frame of data of said au- 
dio signal, wherein the length of said encoder pseudo- 
noise sequence (ENCPNSEQ) was one Nth of the length of a 
frame of said audio signal, N being an integer number 
greater one, wherein N orthogonal encoder pseudo-noise 
sequences (ENCPNSEQ) were used per frame of said audio 
signal for carrying out said combining for corresponding 
sections of a current frame, 
said method including the steps: 

- Receiving (REC, SYNC) and synchronising said transmitted 
or transferred audio signal; 

- Convolving (DRECMF) each one of a corresponding section 
of said current frame of data of said audio signal with 
the corresponding one of time-inversed versions 
(DECPNSEQ) of the N orthogonal encoder pseudo-noise se- 
quences ; 

- Determining (DRECMF), for each one of said sections, from 
the sign of the peak or peaks of the corresponding convo- 
lution result the value of a bit of said watermark data 
(OWATD) . 

3 . Method for regaining watermark data bits (IWATD) embedded 
in a spread spectrum, whereby the corresponding original 
watermark data bits were modulated (BVMOD) at encoder 
side on an encoder pseudo -noise sequence (ENCPNSEQ) and 
said modulated encoder pseudo -noise sequence (WATS) was 
transformed (WATSE) into the frequency domain and shaped 
in amplitude according to the masking level curve (PSYMC) 
of ah audio signal together with which the watermark data 
bit information was transmitted or transferred (TRM) , and 
said shaped encoder pseudo-noise frequency domain se- 
quence was transformed (WATSE) back into the time domain 
and was combined with a current frame of data of said au- 
dio signal, wherein the length of said encoder pseudo- 
noise sequence (ENCPNSEQ) corresponded to the length of a 



PD03 0120-Ha- 051203 



17 



frame of said audio signal and said encoder pseudo-noise 
sequence (ENCPNSEQ) was used for carrying out said com- 
bining for a current frame, 
said method including the steps: 
- Receiving (REC, SYNC) and synchronising said transmitted 
or transferred audio signal ; 

Determining (EDET) in the received audio signal one or 
more echoes and the related echo delays ; 
Constructing a modified decoder pseudo-noise sequence 
(MDECPNSEQ) based on the time-inversed version of said 
encoder pseudo-noise sequence (ENCPNSEQ) whereby, accord- 
ing to the echo delay or delays determined, correspond- 
ingly time-shifted versions of said time-inversed encoder 
pseudo-noise sequence are combined in order to construct 
said modified decoder pseudo-noise sequence; 
Convolving (DRECMF) said current frame of data of said 
audio signal with said modified decoder pseudo-noise se- 
quence (MDECPNSEQ) ; 

Determining (DRECMF) from the sign of the peak or peaks 
of the convolution result the value of a bit of said wa- 
termark data (OWATD) . 

Method according to claim 3, wherein the length of said 
encoder pseudo-noise sequence (ENCPNSEQ) is one Nth of 
the length of a frame of said audio signal, N being an 
integer number greater one, wherein N orthogonal encoder 
pseudo-noise sequences (ENCPNSEQ) were used per frame of 
said audio signal for carrying out said combining for 
corresponding sections of a current frame, 
and wherein, for said constructing step, the N time- 
inversed versions of said orthogonal encoder pseudo-noise 
sequences (ENCPNSEQ) for a current frame are assembled 
together before applying said combining, 

and wherein each one of a corresponding section of said 
current frame of data of said audio signal is convolved 
(DRECMF) with the corresponding section of said modified 
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decoder pseudo-noise sequence (MDECPNSEQ) , 

and wherein, for each one of said sections, from the sign 
of the peak or peaks of the corresponding convolution re- 
sult the value of a bit of said watermark data (OWATD) is 
determined (DRECMF) . 

Method according to claim 3 or 4 wherein, when determin- 
ing (EDET) in the received audio signal one or more ech- 
oes and the related echo delays, the results for several 
audio frames are evaluated before a final result on the 
echo delay is formed. 

Apparatus for transmitting watermark data bits (IWATD) 
using a spread spectrum, said apparatus including: 
Means (BVMOD) for modulating said watermark data bits on 
an encoder pseudo-noise sequence (ENCPNSEQ) ; 
Means (WATSE) for transforming said modulated encoder 
pseudo-noise sequence (WATS) into the frequency domain 
and for shaping it in amplitude according to the masking 
level curve of an audio signal together with which the 
watermark data bit information is to be transmitted or 
transferred, and for transforming said shaped encoder 
pseudo-noise frequency domain sequence back into the time 
domain; 

Means (WATSE) for combining said inverse transformed en- 
coder pseudo-noise frequency domain sequence with a cur- 
rent frame of data of said audio signal; 
Means (TRM) for transmitting or transferring said com- 
bined audio signal frame or frames carrying said water- 
mark data bits, 

wherein the length of said encoder pseudo-noise sequence 
(ENCPNSEQ) is one Nth of the length of a frame of said 
audio signal, N being an integer number greater one, 
wherein N orthogonal encoder pseudo- noise sequences 
(ENCPNSEQ) are used per frame of said audio signal for 
carrying out said combining for corresponding sections of 
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a current frame. 



7. Apparatus for regaining watermark data bits (IWATD) era- 
bedded in a spread spectrum, whereby the corresponding 
original watermark data bits were modulated (BVMOD) at 
encoder side on an encoder pseudo-noise sequence 
(ENCPNSEQ) and said modulated encoder pseudo-noise se- 
quence (WATS) was transformed (WATSE) into the frequency 
domain and shaped in amplitude according to the masking 
level curve (PSYMC) of an audio signal together with 
which the watermark data bit information was transmitted 
or transferred (TRM) , and said shaped encoder pseudo- 
noise frequency domain sequence was transformed (WATSE) 
back into the time domain and was combined with a current 
frame of data of said audio signal, wherein the length of 
said encoder pseudo-noise sequence (ENCPNSEQ) was one Nth 
of the length of a frame of said audio signal, N being an 
integer number greater one, wherein N orthogonal encoder 
pseudo-noise sequences (ENCPNSEQ) were used per frame of 
said audio signal for carrying out said combining for 
corresponding sections of a current frame, 
said apparatus including: 

Means (REC, SYNC) for receiving and synchronising said 
transmitted or transferred audio signal ,- 

Means (DRECMF) for convolving each one of a corresponding 
section of said current frame of data of said audio sig- 
nal with the corresponding one of time-inversed versions 
(DECPNSEQ) of the N orthogonal encoder pseudo-noise se- 
quences, and for determining, for each one of said sec- 
tions, from the sign of the peak or peaks of the corre- 
sponding convolution result the value of a bit of said 
watermark data (OWATD) . 

Apparatus for regaining watermark data bits (IWATD) em- 
bedded in a spread spectrum, whereby the corresponding 
orxgmal watermark data bits were modulated (BVMOD) at 
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encoder side on an encoder pseudo-noise sequence 
(ENCPNSEQ) and said modulated encoder pseudo-noise se- 
quence (WATS) was transformed (WATSE) into the frequency 
domain and shaped in amplitude according to the masking 
level curve (PSYMC) o£ an audio signal together with 
which the watermark data bit information was transmitted 
or transferred (TRM) , and said shaped encoder pseudo- 
noise frequency domain sequence was transformed (WATSE) 
back into the time domain and was combined with a current 
frame of data of said audio signal, wherein the length of 
said encoder pseudo-noise sequence (ENCPNSEQ) corre- 
sponded to the length of a frame of said audio signal and 
said encoder pseudo-noise sequence (ENCPNSEQ) was used 
for carrying out said combining for a current frame, 
said apparatus including: 

- Means (REC, SYNC) for receiving and synchronising said 
transmitted or transferred audio signal; 

- Means (EDET) for determining in the received audio signal 
one or more echoes and the related echo delays, and for 
constructing a modified decoder pseudo-noise sequence 
(MDECPNSEQ) based on the time-inversed version of said 
encoder pseudo-noise sequence (ENCPNSEQ) whereby, accord- 
ing to the echo delay or delays determined, correspond- 
ingly time-shifted versions of said time-inversed encoder 
pseudo-noise sequence are combined in order to construct 
said modified decoder pseudo-noise sequence; 

- Means (DRECMF) for convolving said current frame of data 
of said audio signal with said modified decoder pseudo- 
noise sequence (MDECPNSEQ) , and for determining from the 
sign of the peak or peaks of the convolution result the 
value of a bit of said watermark data (OWATD) . 

9. Apparatus according to claim 8, wherein the length of 

said encoder pseudo-noise sequence (ENCPNSEQ) is one Nth 
of the length of a frame of said audio signal, N being an 
integer number greater one, wherein N orthogonal encoder 
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pseudo-noise sequences (ENCPNSEQ) were used per frame of 
said audio signal for carrying out said combining for 
corresponding sections of a current frame ^ 
and wherein, in said determining means, the N time- 
inversed versions of said orthogonal encoder pseudo-noise 
sequences (ENCPNSEQ) for a current frame are assembled 
together before applying said combining, 

and wherein each one of a corresponding section of said 
current frame of data of said audio signal is convolved 
in said convolving and determining means (DRECMF) with 
the corresponding section of said modified decoder 
pseudo-noise sequence (MDECPNSEQ) , 

and wherein, for each one of said sections, from the sign 
of the peak or peaks of the corresponding convolution re- 
sult the value of a bit of said watermark data (OWATD) is 
determined in said convolving and determining means 
(DRECMF) . 

10 Apparatus according to claim 8 or 9 wherein, in said de- 
termining means (EDET) , in the received audio signal one 
or more echoes and the related echo delays, the results 
for several audio frames are evaluated before a final re- 
sult on the echo delay is foarmed. 
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Abstract 

Spread spectrum technology and the related inserted or added 
information signal can be used for implementing watermarking 
5 digital audio signals. A known processing for retrieving at 
receiver or decoder side the watermark signal information 
bit from the spread spectrum is convolving the received or 
replayed spectrum with a spreading function that is time- 
inverse with respect to the original spreading function. If 

10 BPSK modulation was used for applying the spread spectrum 

function, the output is a peak at the middle of the sequence 
of correlation values, the sign of such peak representing 
the value of the desired watermark signal information bit. 
According to the invention, in order to cope with echo dis- 

15 tortions, two or more orthogonal spreading sequences are 

used at encoder side with the original or encoded audio sig- 
nal in baseband. When applying the corresponding time- ; 
inverse orthogonal spreading sequences at decoder side, ech- 
oes that are longer than each one of spreading sequence's 

20 lengths can be fully removed. The spreading sequences ap- 
plied can be modified at decoder side according to estimated 
echo delay values. 
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